Dataset statistics
| Number of variables | 34 |
|---|---|
| Number of observations | 19997 |
| Missing cells | 8844 |
| Missing cells (%) | 1.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 5.2 MiB |
| Average record size in memory | 272.0 B |
Variable types
| NUM | 13 |
|---|---|
| CAT | 13 |
| BOOL | 5 |
| DATE | 2 |
| UNSUPPORTED | 1 |
Reproduction
| Analysis started | 2020-09-06 14:49:24.596157 |
|---|---|
| Analysis finished | 2020-09-06 14:50:24.713062 |
| Duration | 1 minute and 0.12 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
first_name has a high cardinality: 2839 distinct values | High cardinality |
last_name has a high cardinality: 3267 distinct values | High cardinality |
job_title has a high cardinality: 195 distinct values | High cardinality |
address has a high cardinality: 3487 distinct values | High cardinality |
Age Scale is highly correlated with Age | High correlation |
Age is highly correlated with Age Scale | High correlation |
online_order has 360 (1.8%) missing values | Missing |
last_name has 642 (3.2%) missing values | Missing |
DOB has 446 (2.2%) missing values | Missing |
job_title has 2394 (12.0%) missing values | Missing |
job_industry_category has 3229 (16.1%) missing values | Missing |
tenure has 446 (2.2%) missing values | Missing |
transaction_id has unique values | Unique |
DOB is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
product_id has 1375 (6.9%) zeros | Zeros |
| Distinct count | 19997 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9999.856078411762 |
|---|---|
| Minimum | 1 |
| Maximum | 20000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1000.8 |
| Q1 | 5000 |
| median | 10000 |
| Q3 | 14999 |
| 95-th percentile | 19000.2 |
| Maximum | 20000 |
| Range | 19999 |
| Interquartile range (IQR) | 9999 |
Descriptive statistics
| Standard deviation | 5773.636854 |
|---|---|
| Coefficient of variation (CV) | 0.5773719951 |
| Kurtosis | -1.199948114 |
| Mean | 9999.856078 |
| Median Absolute Deviation (MAD) | 5000 |
| Skewness | 0.0001487462264 |
| Sum | 199967122 |
| Variance | 33334882.52 |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 10912 | 1 | < 0.1% | |
| 12947 | 1 | < 0.1% | |
| 2708 | 1 | < 0.1% | |
| 661 | 1 | < 0.1% | |
| 6806 | 1 | < 0.1% | |
| 4759 | 1 | < 0.1% | |
| 19100 | 1 | < 0.1% | |
| 17053 | 1 | < 0.1% | |
| 8865 | 1 | < 0.1% | |
| Other values (19987) | 19987 | 99.9% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 20000 | 1 | < 0.1% | |
| 19999 | 1 | < 0.1% | |
| 19998 | 1 | < 0.1% | |
| 19997 | 1 | < 0.1% | |
| 19996 | 1 | < 0.1% |
| Distinct count | 101 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45.37145571835775 |
|---|---|
| Minimum | 0 |
| Maximum | 100 |
| Zeros | 1375 |
| Zeros (%) | 6.9% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 18 |
| median | 44 |
| Q3 | 72 |
| 95-th percentile | 94 |
| Maximum | 100 |
| Range | 100 |
| Interquartile range (IQR) | 54 |
Descriptive statistics
| Standard deviation | 30.75087583 |
|---|---|
| Coefficient of variation (CV) | 0.6777581929 |
| Kurtosis | -1.247667283 |
| Mean | 45.37145572 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 0.08167230031 |
| Sum | 907293 |
| Variance | 945.6163646 |
| Value | Count | Frequency (%) | |
| 0 | 1375 | 6.9% | |
| 3 | 354 | 1.8% | |
| 1 | 311 | 1.6% | |
| 35 | 268 | 1.3% | |
| 38 | 267 | 1.3% | |
| 4 | 241 | 1.2% | |
| 2 | 240 | 1.2% | |
| 90 | 225 | 1.1% | |
| 12 | 224 | 1.1% | |
| 80 | 223 | 1.1% | |
| Other values (91) | 16269 | 81.4% |
| Value | Count | Frequency (%) | |
| 0 | 1375 | 6.9% | |
| 1 | 311 | 1.6% | |
| 2 | 240 | 1.2% | |
| 3 | 354 | 1.8% | |
| 4 | 241 | 1.2% |
| Value | Count | Frequency (%) | |
| 100 | 130 | 0.7% | |
| 99 | 152 | 0.8% | |
| 98 | 156 | 0.8% | |
| 97 | 142 | 0.7% | |
| 96 | 161 | 0.8% |
customer_id
Real number (ℝ≥0)
| Distinct count | 3493 |
|---|---|
| Unique (%) | 17.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1737.7516127419112 |
|---|---|
| Minimum | 1 |
| Maximum | 3500 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 172 |
| Q1 | 857 |
| median | 1736 |
| Q3 | 2613 |
| 95-th percentile | 3320 |
| Maximum | 3500 |
| Range | 3499 |
| Interquartile range (IQR) | 1756 |
Descriptive statistics
| Standard deviation | 1011.221384 |
|---|---|
| Coefficient of variation (CV) | 0.5819136499 |
| Kurtosis | -1.203543648 |
| Mean | 1737.751613 |
| Median Absolute Deviation (MAD) | 878 |
| Skewness | 0.008740971946 |
| Sum | 34749819 |
| Variance | 1022568.687 |
| Value | Count | Frequency (%) | |
| 2183 | 14 | 0.1% | |
| 1068 | 14 | 0.1% | |
| 2476 | 14 | 0.1% | |
| 2072 | 13 | 0.1% | |
| 637 | 13 | 0.1% | |
| 1672 | 13 | 0.1% | |
| 1946 | 13 | 0.1% | |
| 3232 | 13 | 0.1% | |
| 1140 | 13 | 0.1% | |
| 2912 | 13 | 0.1% | |
| Other values (3483) | 19864 | 99.3% |
| Value | Count | Frequency (%) | |
| 1 | 11 | 0.1% | |
| 2 | 3 | < 0.1% | |
| 3 | 8 | < 0.1% | |
| 4 | 2 | < 0.1% | |
| 5 | 6 | < 0.1% |
| Value | Count | Frequency (%) | |
| 3500 | 6 | < 0.1% | |
| 3499 | 7 | < 0.1% | |
| 3498 | 6 | < 0.1% | |
| 3497 | 3 | < 0.1% | |
| 3496 | 4 | < 0.1% |
transaction_date
Date
| Distinct count | 364 |
|---|---|
| Unique (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
| Minimum | 2017-01-01 00:00:00 |
|---|---|
| Maximum | 2017-12-30 00:00:00 |
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 360 |
| Missing (%) | 1.8% |
| Memory size | 156.2 KiB |
| 1 | |
|---|---|
| 0 | |
| (Missing) | 360 |
| Value | Count | Frequency (%) | |
| 1 | 9829 | 49.2% | |
| 0 | 9808 | 49.0% | |
| (Missing) | 360 | 1.8% |
order_status
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
| Approved | |
|---|---|
| Cancelled | 179 |
| Value | Count | Frequency (%) | |
| Approved | 19818 | 99.1% | |
| Cancelled | 179 | 0.9% |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 8.008951343 |
| Min length | 8 |
brand
Categorical
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 197 |
| Missing (%) | 1.0% |
| Memory size | 156.2 KiB |
| Solex | |
|---|---|
| Giant Bicycles | |
| WeareA2B | |
| OHM Cycles | |
| Trek Bicycles |
| Value | Count | Frequency (%) | |
| Solex | 4252 | 21.3% | |
| Giant Bicycles | 3312 | 16.6% | |
| WeareA2B | 3295 | 16.5% | |
| OHM Cycles | 3042 | 15.2% | |
| Trek Bicycles | 2990 | 15.0% | |
| Norco Bicycles | 2909 | 14.5% | |
| (Missing) | 197 | 1.0% |
Length
| Max length | 14 |
|---|---|
| Median length | 10 |
| Mean length | 10.23128469 |
| Min length | 3 |
product_line
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 197 |
| Missing (%) | 1.0% |
| Memory size | 156.2 KiB |
| Standard | |
|---|---|
| Road | |
| Touring | 1234 |
| Mountain | 423 |
| Value | Count | Frequency (%) | |
| Standard | 14175 | 70.9% | |
| Road | 3968 | 19.8% | |
| Touring | 1234 | 6.2% | |
| Mountain | 423 | 2.1% | |
| (Missing) | 197 | 1.0% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.095314297 |
| Min length | 3 |
product_class
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 197 |
| Missing (%) | 1.0% |
| Memory size | 156.2 KiB |
| medium | |
|---|---|
| high | |
| low |
| Value | Count | Frequency (%) | |
| medium | 13823 | 69.1% | |
| high | 3013 | 15.1% | |
| low | 2964 | 14.8% | |
| (Missing) | 197 | 1.0% |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.224433665 |
| Min length | 3 |
product_size
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 197 |
| Missing (%) | 1.0% |
| Memory size | 156.2 KiB |
| medium | |
|---|---|
| large | |
| small |
| Value | Count | Frequency (%) | |
| medium | 12987 | 64.9% | |
| large | 3976 | 19.9% | |
| small | 2837 | 14.2% | |
| (Missing) | 197 | 1.0% |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.629744462 |
| Min length | 3 |
list_price
Real number (ℝ≥0)
| Distinct count | 296 |
|---|---|
| Unique (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1107.919640946142 |
|---|---|
| Minimum | 12.01 |
| Maximum | 2091.47 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 12.01 |
|---|---|
| 5-th percentile | 100.35 |
| Q1 | 575.27 |
| median | 1163.89 |
| Q3 | 1635.3 |
| 95-th percentile | 1992.93 |
| Maximum | 2091.47 |
| Range | 2079.46 |
| Interquartile range (IQR) | 1060.03 |
Descriptive statistics
| Standard deviation | 582.8187868 |
|---|---|
| Coefficient of variation (CV) | 0.5260478877 |
| Kurtosis | -1.083023045 |
| Mean | 1107.919641 |
| Median Absolute Deviation (MAD) | 521.58 |
| Skewness | -0.1260900774 |
| Sum | 22155069.06 |
| Variance | 339677.7383 |
| Value | Count | Frequency (%) | |
| 2091.47 | 465 | 2.3% | |
| 1403.5 | 396 | 2.0% | |
| 71.49 | 274 | 1.4% | |
| 1231.15 | 235 | 1.2% | |
| 1890.39 | 233 | 1.2% | |
| 1129.13 | 232 | 1.2% | |
| 1073.07 | 229 | 1.1% | |
| 1894.19 | 228 | 1.1% | |
| 945.04 | 226 | 1.1% | |
| 574.64 | 223 | 1.1% | |
| Other values (286) | 17256 | 86.3% |
| Value | Count | Frequency (%) | |
| 12.01 | 195 | 1.0% | |
| 16.08 | 1 | < 0.1% | |
| 26.15 | 1 | < 0.1% | |
| 32.44 | 1 | < 0.1% | |
| 36.78 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 2091.47 | 465 | 2.3% | |
| 2086.07 | 1 | < 0.1% | |
| 2083.94 | 208 | 1.0% | |
| 2076.81 | 1 | < 0.1% | |
| 2064.08 | 1 | < 0.1% |
standard_cost
Real number (ℝ≥0)
| Distinct count | 100 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 197 |
| Missing (%) | 1.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 556.0680474747475 |
|---|---|
| Minimum | 7.21 |
| Maximum | 1759.85 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 7.21 |
|---|---|
| 5-th percentile | 53.62 |
| Q1 | 215.14 |
| median | 507.58 |
| Q3 | 795.1 |
| 95-th percentile | 1479.11 |
| Maximum | 1759.85 |
| Range | 1752.64 |
| Interquartile range (IQR) | 579.96 |
Descriptive statistics
| Standard deviation | 405.9768807 |
|---|---|
| Coefficient of variation (CV) | 0.7300848926 |
| Kurtosis | 0.2867003197 |
| Mean | 556.0680475 |
| Median Absolute Deviation (MAD) | 287.52 |
| Skewness | 0.864008653 |
| Sum | 11010147.34 |
| Variance | 164817.2277 |
| Value | Count | Frequency (%) | |
| 388.92 | 465 | 2.3% | |
| 954.82 | 396 | 2.0% | |
| 53.62 | 274 | 1.4% | |
| 161.6 | 235 | 1.2% | |
| 260.14 | 233 | 1.2% | |
| 677.48 | 232 | 1.2% | |
| 933.84 | 229 | 1.1% | |
| 598.76 | 228 | 1.1% | |
| 507.58 | 226 | 1.1% | |
| 459.71 | 223 | 1.1% | |
| Other values (90) | 17059 | 85.3% |
| Value | Count | Frequency (%) | |
| 7.21 | 195 | 1.0% | |
| 13.44 | 187 | 0.9% | |
| 44.71 | 198 | 1.0% | |
| 45.26 | 188 | 0.9% | |
| 53.62 | 274 | 1.4% |
| Value | Count | Frequency (%) | |
| 1759.85 | 195 | 1.0% | |
| 1610.9 | 200 | 1.0% | |
| 1580.47 | 190 | 1.0% | |
| 1531.42 | 169 | 0.8% | |
| 1516.13 | 185 | 0.9% |
Margin
Real number (ℝ≥0)
| Distinct count | 296 |
|---|---|
| Unique (%) | 1.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 557.329685452818 |
|---|---|
| Minimum | 4.8 |
| Maximum | 2086.07 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 4.8 |
|---|---|
| 5-th percentile | 25.09 |
| Q1 | 135.85 |
| median | 445.21 |
| Q3 | 830.24 |
| 95-th percentile | 1612.25 |
| Maximum | 2086.07 |
| Range | 2081.27 |
| Interquartile range (IQR) | 694.39 |
Descriptive statistics
| Standard deviation | 497.2945398 |
|---|---|
| Coefficient of variation (CV) | 0.8922807322 |
| Kurtosis | -0.4062154187 |
| Mean | 557.3296855 |
| Median Absolute Deviation (MAD) | 330.28 |
| Skewness | 0.8451109374 |
| Sum | 11144921.72 |
| Variance | 247301.8593 |
| Value | Count | Frequency (%) | |
| 1702.55 | 465 | 2.3% | |
| 448.68 | 396 | 2.0% | |
| 17.87 | 274 | 1.4% | |
| 1069.55 | 235 | 1.2% | |
| 1630.25 | 233 | 1.2% | |
| 451.65 | 232 | 1.2% | |
| 139.23 | 229 | 1.1% | |
| 1295.43 | 228 | 1.1% | |
| 437.46 | 226 | 1.1% | |
| 114.93 | 223 | 1.1% | |
| Other values (286) | 17256 | 86.3% |
| Value | Count | Frequency (%) | |
| 4.8 | 195 | 1.0% | |
| 14.23 | 163 | 0.8% | |
| 15.08 | 188 | 0.9% | |
| 16.08 | 1 | < 0.1% | |
| 17.87 | 274 | 1.4% |
| Value | Count | Frequency (%) | |
| 2086.07 | 1 | < 0.1% | |
| 2076.81 | 1 | < 0.1% | |
| 2064.08 | 1 | < 0.1% | |
| 2062.95 | 1 | < 0.1% | |
| 2061.38 | 1 | < 0.1% |
| Distinct count | 100 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 197 |
| Missing (%) | 1.0% |
| Memory size | 156.2 KiB |
| Minimum | 1991-01-21 00:00:00 |
|---|---|
| Maximum | 2016-12-06 00:00:00 |
| Distinct count | 2839 |
|---|---|
| Unique (%) | 14.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
| Corabelle | 36 |
|---|---|
| Tobe | 31 |
| Emlyn | 29 |
| Lindsay | 28 |
| Gar | 26 |
| Other values (2834) |
| Value | Count | Frequency (%) | |
| Corabelle | 36 | 0.2% | |
| Tobe | 31 | 0.2% | |
| Emlyn | 29 | 0.1% | |
| Lindsay | 28 | 0.1% | |
| Gar | 26 | 0.1% | |
| Max | 26 | 0.1% | |
| Catie | 26 | 0.1% | |
| Keeley | 25 | 0.1% | |
| Hubie | 24 | 0.1% | |
| Ebba | 24 | 0.1% | |
| Other values (2829) | 19722 | 98.6% |
Length
| Max length | 14 |
|---|---|
| Median length | 6 |
| Mean length | 5.960194029 |
| Min length | 2 |
| Distinct count | 3267 |
|---|---|
| Unique (%) | 16.9% |
| Missing | 642 |
| Missing (%) | 3.2% |
| Memory size | 156.2 KiB |
| Gladman | 24 |
|---|---|
| Fyndon | 23 |
| Leek | 18 |
| Elgey | 18 |
| Ramsdell | 18 |
| Other values (3262) |
| Value | Count | Frequency (%) | |
| Gladman | 24 | 0.1% | |
| Fyndon | 23 | 0.1% | |
| Leek | 18 | 0.1% | |
| Elgey | 18 | 0.1% | |
| Ramsdell | 18 | 0.1% | |
| Creebo | 18 | 0.1% | |
| Mulliner | 17 | 0.1% | |
| Alpes | 17 | 0.1% | |
| Lithgow | 17 | 0.1% | |
| Pristnor | 17 | 0.1% | |
| Other values (3257) | 19168 | 95.9% | |
| (Missing) | 642 | 3.2% |
Length
| Max length | 19 |
|---|---|
| Median length | 7 |
| Mean length | 6.889983498 |
| Min length | 2 |
gender_encoded
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 10549 | 52.8% | |
| 1 | 9448 | 47.2% |
past_3_years_bike_related_purchases
Real number (ℝ≥0)
| Distinct count | 100 |
|---|---|
| Unique (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48.772465869880484 |
|---|---|
| Minimum | 0 |
| Maximum | 99 |
| Zeros | 188 |
| Zeros (%) | 0.9% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 24 |
| median | 48 |
| Q3 | 73 |
| 95-th percentile | 95 |
| Maximum | 99 |
| Range | 99 |
| Interquartile range (IQR) | 49 |
Descriptive statistics
| Standard deviation | 28.59825009 |
|---|---|
| Coefficient of variation (CV) | 0.5863605536 |
| Kurtosis | -1.17658005 |
| Mean | 48.77246587 |
| Median Absolute Deviation (MAD) | 25 |
| Skewness | 0.05797703879 |
| Sum | 975303 |
| Variance | 817.859908 |
| Value | Count | Frequency (%) | |
| 16 | 291 | 1.5% | |
| 80 | 273 | 1.4% | |
| 48 | 257 | 1.3% | |
| 20 | 256 | 1.3% | |
| 2 | 256 | 1.3% | |
| 67 | 255 | 1.3% | |
| 13 | 254 | 1.3% | |
| 83 | 250 | 1.3% | |
| 19 | 250 | 1.3% | |
| 53 | 250 | 1.3% | |
| Other values (90) | 17405 | 87.0% |
| Value | Count | Frequency (%) | |
| 0 | 188 | 0.9% | |
| 1 | 176 | 0.9% | |
| 2 | 256 | 1.3% | |
| 3 | 133 | 0.7% | |
| 4 | 178 | 0.9% |
| Value | Count | Frequency (%) | |
| 99 | 214 | 1.1% | |
| 98 | 246 | 1.2% | |
| 97 | 216 | 1.1% | |
| 96 | 229 | 1.1% | |
| 95 | 141 | 0.7% |
| Distinct count | 57 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44.31814772215832 |
|---|---|
| Minimum | 18 |
| Maximum | 150 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 22 |
| Q1 | 33 |
| median | 43 |
| Q3 | 53 |
| 95-th percentile | 65 |
| Maximum | 150 |
| Range | 132 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 17.06849624 |
|---|---|
| Coefficient of variation (CV) | 0.3851355961 |
| Kurtosis | 6.969850922 |
| Mean | 44.31814772 |
| Median Absolute Deviation (MAD) | 10 |
| Skewness | 1.891374652 |
| Sum | 886230 |
| Variance | 291.3335639 |
| Value | Count | Frequency (%) | |
| 42 | 1349 | 6.7% | |
| 43 | 905 | 4.5% | |
| 46 | 705 | 3.5% | |
| 44 | 664 | 3.3% | |
| 41 | 631 | 3.2% | |
| 45 | 597 | 3.0% | |
| 40 | 594 | 3.0% | |
| 39 | 526 | 2.6% | |
| 47 | 498 | 2.5% | |
| 34 | 494 | 2.5% | |
| Other values (47) | 13034 | 65.2% |
| Value | Count | Frequency (%) | |
| 18 | 104 | 0.5% | |
| 19 | 145 | 0.7% | |
| 20 | 242 | 1.2% | |
| 21 | 359 | 1.8% | |
| 22 | 333 | 1.7% |
| Value | Count | Frequency (%) | |
| 150 | 9 | < 0.1% | |
| 120 | 446 | 2.2% | |
| 88 | 10 | 0.1% | |
| 85 | 5 | < 0.1% | |
| 79 | 3 | < 0.1% |
| Distinct count | 57 |
|---|---|
| Unique (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.29545431814772216 |
|---|---|
| Minimum | 0.12 |
| Maximum | 1.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 0.12 |
|---|---|
| 5-th percentile | 0.1466666667 |
| Q1 | 0.22 |
| median | 0.2866666667 |
| Q3 | 0.3533333333 |
| 95-th percentile | 0.4333333333 |
| Maximum | 1 |
| Range | 0.88 |
| Interquartile range (IQR) | 0.1333333333 |
Descriptive statistics
| Standard deviation | 0.1137899749 |
|---|---|
| Coefficient of variation (CV) | 0.3851355961 |
| Kurtosis | 6.969850922 |
| Mean | 0.2954543181 |
| Median Absolute Deviation (MAD) | 0.06666666667 |
| Skewness | 1.891374652 |
| Sum | 5908.2 |
| Variance | 0.0129481584 |
| Value | Count | Frequency (%) | |
| 0.28 | 1349 | 6.7% | |
| 0.2866666667 | 905 | 4.5% | |
| 0.3066666667 | 705 | 3.5% | |
| 0.2933333333 | 664 | 3.3% | |
| 0.2733333333 | 631 | 3.2% | |
| 0.3 | 597 | 3.0% | |
| 0.2666666667 | 594 | 3.0% | |
| 0.26 | 526 | 2.6% | |
| 0.3133333333 | 498 | 2.5% | |
| 0.2266666667 | 494 | 2.5% | |
| Other values (47) | 13034 | 65.2% |
| Value | Count | Frequency (%) | |
| 0.12 | 104 | 0.5% | |
| 0.1266666667 | 145 | 0.7% | |
| 0.1333333333 | 242 | 1.2% | |
| 0.14 | 359 | 1.8% | |
| 0.1466666667 | 333 | 1.7% |
| Value | Count | Frequency (%) | |
| 1 | 9 | < 0.1% | |
| 0.8 | 446 | 2.2% | |
| 0.5866666667 | 10 | 0.1% | |
| 0.5666666667 | 5 | < 0.1% | |
| 0.5266666667 | 3 | < 0.1% |
| Distinct count | 195 |
|---|---|
| Unique (%) | 1.1% |
| Missing | 2394 |
| Missing (%) | 12.0% |
| Memory size | 156.2 KiB |
| Social Worker | 226 |
|---|---|
| Legal Assistant | 221 |
| Business Systems Development Analyst | 221 |
| Assistant Professor | 212 |
| Executive Secretary | 208 |
| Other values (190) |
| Value | Count | Frequency (%) | |
| Social Worker | 226 | 1.1% | |
| Legal Assistant | 221 | 1.1% | |
| Business Systems Development Analyst | 221 | 1.1% | |
| Assistant Professor | 212 | 1.1% | |
| Executive Secretary | 208 | 1.0% | |
| Internal Auditor | 207 | 1.0% | |
| Nuclear Power Engineer | 205 | 1.0% | |
| Tax Accountant | 197 | 1.0% | |
| Administrative Officer | 196 | 1.0% | |
| Chemical Engineer | 195 | 1.0% | |
| Other values (185) | 15515 | 77.6% | |
| (Missing) | 2394 | 12.0% |
Length
| Max length | 36 |
|---|---|
| Median length | 17 |
| Mean length | 16.40761114 |
| Min length | 3 |
| Distinct count | 9 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 3229 |
| Missing (%) | 16.1% |
| Memory size | 156.2 KiB |
| Manufacturing | |
|---|---|
| Financial Services | |
| Health | |
| Retail | |
| Property | |
| Other values (4) |
| Value | Count | Frequency (%) | |
| Manufacturing | 4014 | 20.1% | |
| Financial Services | 3886 | 19.4% | |
| Health | 3099 | 15.5% | |
| Retail | 1758 | 8.8% | |
| Property | 1297 | 6.5% | |
| IT | 1084 | 5.4% | |
| Entertainment | 698 | 3.5% | |
| Argiculture | 578 | 2.9% | |
| Telecommunications | 354 | 1.8% | |
| (Missing) | 3229 | 16.1% |
Length
| Max length | 18 |
|---|---|
| Median length | 8 |
| Mean length | 9.766815022 |
| Min length | 2 |
job_industry_category_encoded
Real number (ℝ)
| Distinct count | 10 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -0.41421213181977296 |
|---|---|
| Minimum | -5 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | -5 |
|---|---|
| 5-th percentile | -5 |
| Q1 | -3 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 5 |
| Range | 10 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 2.976688795 |
|---|---|
| Coefficient of variation (CV) | -7.186387279 |
| Kurtosis | -1.443477946 |
| Mean | -0.4142121318 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.003231128853 |
| Sum | -8283 |
| Variance | 8.860676181 |
| Value | Count | Frequency (%) | |
| 1 | 4014 | 20.1% | |
| -3 | 3886 | 19.4% | |
| 2 | 3229 | 16.1% | |
| -4 | 3099 | 15.5% | |
| 4 | 1758 | 8.8% | |
| 3 | 1297 | 6.5% | |
| -5 | 1084 | 5.4% | |
| -2 | 698 | 3.5% | |
| -1 | 578 | 2.9% | |
| 5 | 354 | 1.8% |
| Value | Count | Frequency (%) | |
| -5 | 1084 | 5.4% | |
| -4 | 3099 | 15.5% | |
| -3 | 3886 | 19.4% | |
| -2 | 698 | 3.5% | |
| -1 | 578 | 2.9% |
| Value | Count | Frequency (%) | |
| 5 | 354 | 1.8% | |
| 4 | 1758 | 8.8% | |
| 3 | 1297 | 6.5% | |
| 2 | 3229 | 16.1% | |
| 1 | 4014 | 20.1% |
wealth_segment_encoded
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
| -1 | |
|---|---|
| 1 | |
| 0 |
| Value | Count | Frequency (%) | |
| -1 | 10141 | 50.7% | |
| 1 | 5049 | 25.2% | |
| 0 | 4807 | 24.0% |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.507126069 |
| Min length | 1 |
deceased_indicator
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
| N | |
|---|---|
| Y | 8 |
| Value | Count | Frequency (%) | |
| N | 19989 | > 99.9% | |
| Y | 8 | < 0.1% |
owns_car_encoded
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
| 0 | |
|---|---|
| 1 |
| Value | Count | Frequency (%) | |
| 0 | 10012 | 50.1% | |
| 1 | 9985 | 49.9% |
| Distinct count | 22 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 446 |
| Missing (%) | 2.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.683238709017441 |
|---|---|
| Minimum | 1.0 |
| Maximum | 22.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 6 |
| median | 11 |
| Q3 | 15 |
| 95-th percentile | 20 |
| Maximum | 22 |
| Range | 21 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 5.676402887 |
|---|---|
| Coefficient of variation (CV) | 0.5313372697 |
| Kurtosis | -1.069951904 |
| Mean | 10.68323871 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.04361215577 |
| Sum | 208868 |
| Variance | 32.22154974 |
| Value | Count | Frequency (%) | |
| 7 | 1190 | 6.0% | |
| 5 | 1096 | 5.5% | |
| 11 | 1096 | 5.5% | |
| 16 | 1067 | 5.3% | |
| 12 | 1060 | 5.3% | |
| 8 | 1032 | 5.2% | |
| 14 | 1019 | 5.1% | |
| 9 | 995 | 5.0% | |
| 17 | 985 | 4.9% | |
| 10 | 985 | 4.9% | |
| Other values (12) | 9026 | 45.1% |
| Value | Count | Frequency (%) | |
| 1 | 876 | 4.4% | |
| 2 | 736 | 3.7% | |
| 3 | 819 | 4.1% | |
| 4 | 929 | 4.6% | |
| 5 | 1096 | 5.5% |
| Value | Count | Frequency (%) | |
| 22 | 255 | 1.3% | |
| 21 | 275 | 1.4% | |
| 20 | 498 | 2.5% | |
| 19 | 837 | 4.2% | |
| 18 | 959 | 4.8% |
| Distinct count | 3487 |
|---|---|
| Unique (%) | 17.5% |
| Missing | 29 |
| Missing (%) | 0.1% |
| Memory size | 156.2 KiB |
| 3 Talisman Place | 14 |
|---|---|
| 8142 Tomscot Drive | 14 |
| 567 Scott Park | 14 |
| 3 Mariners Cove Terrace | 14 |
| 4297 Emmet Lane | 14 |
| Other values (3482) |
| Value | Count | Frequency (%) | |
| 3 Talisman Place | 14 | 0.1% | |
| 8142 Tomscot Drive | 14 | 0.1% | |
| 567 Scott Park | 14 | 0.1% | |
| 3 Mariners Cove Terrace | 14 | 0.1% | |
| 4297 Emmet Lane | 14 | 0.1% | |
| 8587 Graceland Way | 13 | 0.1% | |
| 7916 Clyde Gallagher Place | 13 | 0.1% | |
| 3126 Butterfield Pass | 13 | 0.1% | |
| 259 Barnett Crossing | 13 | 0.1% | |
| 1 Nevada Park | 13 | 0.1% | |
| Other values (3477) | 19833 | 99.2% | |
| (Missing) | 29 | 0.1% |
Length
| Max length | 29 |
|---|---|
| Median length | 18 |
| Mean length | 17.68090214 |
| Min length | 3 |
postcode
Real number (ℝ≥0)
| Distinct count | 835 |
|---|---|
| Unique (%) | 4.2% |
| Missing | 29 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2987.623347355769 |
|---|---|
| Minimum | 2000.0 |
| Maximum | 4883.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 2000 |
|---|---|
| 5-th percentile | 2047 |
| Q1 | 2200 |
| median | 2767 |
| Q3 | 3754 |
| 95-th percentile | 4551 |
| Maximum | 4883 |
| Range | 2883 |
| Interquartile range (IQR) | 1554 |
Descriptive statistics
| Standard deviation | 851.3066466 |
|---|---|
| Coefficient of variation (CV) | 0.284944435 |
| Kurtosis | -0.918265722 |
| Mean | 2987.623347 |
| Median Absolute Deviation (MAD) | 597 |
| Skewness | 0.6261828591 |
| Sum | 59656863 |
| Variance | 724723.0066 |
| Value | Count | Frequency (%) | |
| 2153 | 169 | 0.8% | |
| 2770 | 146 | 0.7% | |
| 2170 | 140 | 0.7% | |
| 2155 | 136 | 0.7% | |
| 3977 | 128 | 0.6% | |
| 2763 | 125 | 0.6% | |
| 2145 | 125 | 0.6% | |
| 2065 | 117 | 0.6% | |
| 2760 | 112 | 0.6% | |
| 2261 | 109 | 0.5% | |
| Other values (825) | 18661 | 93.3% |
| Value | Count | Frequency (%) | |
| 2000 | 41 | 0.2% | |
| 2007 | 13 | 0.1% | |
| 2008 | 7 | < 0.1% | |
| 2009 | 27 | 0.1% | |
| 2010 | 57 | 0.3% |
| Value | Count | Frequency (%) | |
| 4883 | 9 | < 0.1% | |
| 4879 | 11 | 0.1% | |
| 4878 | 12 | 0.1% | |
| 4877 | 7 | < 0.1% | |
| 4873 | 9 | < 0.1% |
state
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 29 |
| Missing (%) | 0.1% |
| Memory size | 156.2 KiB |
| NSW | |
|---|---|
| VIC | |
| QLD | |
| New South Wales | 485 |
| Victoria | 480 |
| Value | Count | Frequency (%) | |
| NSW | 10200 | 51.0% | |
| VIC | 4541 | 22.7% | |
| QLD | 4262 | 21.3% | |
| New South Wales | 485 | 2.4% | |
| Victoria | 480 | 2.4% | |
| (Missing) | 29 | 0.1% |
Length
| Max length | 15 |
|---|---|
| Median length | 3 |
| Mean length | 3.411061659 |
| Min length | 3 |
country
Categorical
| Distinct count | 1 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 29 |
| Missing (%) | 0.1% |
| Memory size | 156.2 KiB |
| Australia |
|---|
| Value | Count | Frequency (%) | |
| Australia | 19968 | 99.9% | |
| (Missing) | 29 | 0.1% |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.991298695 |
| Min length | 3 |
property_valuation
Real number (ℝ≥0)
| Distinct count | 12 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 29 |
| Missing (%) | 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.516376201923077 |
|---|---|
| Minimum | 1.0 |
| Maximum | 12.0 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 156.2 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 6 |
| median | 8 |
| Q3 | 10 |
| 95-th percentile | 11 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.824782869 |
|---|---|
| Coefficient of variation (CV) | 0.3758171216 |
| Kurtosis | -0.3215331451 |
| Mean | 7.516376202 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.6434549432 |
| Sum | 150087 |
| Variance | 7.979398256 |
| Value | Count | Frequency (%) | |
| 8 | 3342 | 16.7% | |
| 9 | 3260 | 16.3% | |
| 10 | 2850 | 14.3% | |
| 7 | 2371 | 11.9% | |
| 11 | 1396 | 7.0% | |
| 6 | 1181 | 5.9% | |
| 5 | 1130 | 5.7% | |
| 4 | 1070 | 5.4% | |
| 12 | 971 | 4.9% | |
| 3 | 903 | 4.5% | |
| Other values (2) | 1494 | 7.5% |
| Value | Count | Frequency (%) | |
| 1 | 807 | 4.0% | |
| 2 | 687 | 3.4% | |
| 3 | 903 | 4.5% | |
| 4 | 1070 | 5.4% | |
| 5 | 1130 | 5.7% |
| Value | Count | Frequency (%) | |
| 12 | 971 | 4.9% | |
| 11 | 1396 | 7.0% | |
| 10 | 2850 | 14.3% | |
| 9 | 3260 | 16.3% | |
| 8 | 3342 | 16.7% |
Label
Boolean
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 156.2 KiB |
| 1 | |
|---|---|
| 0 |
| Value | Count | Frequency (%) | |
| 1 | 10535 | 52.7% | |
| 0 | 9462 | 47.3% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| transaction_id | product_id | customer_id | transaction_date | online_order | order_status | brand | product_line | product_class | product_size | list_price | standard_cost | Margin | product_first_sold_date | first_name | last_name | gender_encoded | past_3_years_bike_related_purchases | DOB | Age | Age Scale | job_title | job_industry_category | job_industry_category_encoded | wealth_segment_encoded | deceased_indicator | owns_car_encoded | tenure | address | postcode | state | country | property_valuation | Label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 2638 | 83 | 34 | 2017-04-07 | 0.0 | Approved | Solex | Touring | medium | large | 2083.94 | 675.03 | 1408.91 | 2013-09-16 | Jephthah | Bachmann | 0 | 59 | 1843-12-21 | 150 | 1.0 | Legal Assistant | IT | -5 | 0 | N | 0 | 20.0 | 833 Luster Way | 4005.0 | QLD | Australia | 8.0 | 1 |
| 1 | 9044 | 12 | 34 | 2017-02-13 | 0.0 | Approved | WeareA2B | Standard | medium | medium | 1231.15 | 161.60 | 1069.55 | 2004-08-17 | Jephthah | Bachmann | 0 | 59 | 1843-12-21 | 150 | 1.0 | Legal Assistant | IT | -5 | 0 | N | 0 | 20.0 | 833 Luster Way | 4005.0 | QLD | Australia | 8.0 | 1 |
| 2 | 16935 | 0 | 34 | 2017-02-14 | 0.0 | Approved | NaN | NaN | NaN | NaN | 1034.17 | NaN | 1034.17 | NaT | Jephthah | Bachmann | 0 | 59 | 1843-12-21 | 150 | 1.0 | Legal Assistant | IT | -5 | 0 | N | 0 | 20.0 | 833 Luster Way | 4005.0 | QLD | Australia | 8.0 | 1 |
| 3 | 19291 | 65 | 34 | 2017-09-19 | 0.0 | Approved | WeareA2B | Standard | medium | medium | 1807.45 | 778.69 | 1028.76 | 2015-05-21 | Jephthah | Bachmann | 0 | 59 | 1843-12-21 | 150 | 1.0 | Legal Assistant | IT | -5 | 0 | N | 0 | 20.0 | 833 Luster Way | 4005.0 | QLD | Australia | 8.0 | 1 |
| 4 | 12083 | 13 | 34 | 2017-07-23 | 0.0 | Approved | Solex | Standard | medium | medium | 1163.89 | 589.27 | 574.62 | 2016-07-09 | Jephthah | Bachmann | 0 | 59 | 1843-12-21 | 150 | 1.0 | Legal Assistant | IT | -5 | 0 | N | 0 | 20.0 | 833 Luster Way | 4005.0 | QLD | Australia | 8.0 | 1 |
| 5 | 9792 | 60 | 34 | 2017-06-25 | 1.0 | Approved | Giant Bicycles | Standard | high | small | 1977.36 | 1759.85 | 217.51 | 2011-08-24 | Jephthah | Bachmann | 0 | 59 | 1843-12-21 | 150 | 1.0 | Legal Assistant | IT | -5 | 0 | N | 0 | 20.0 | 833 Luster Way | 4005.0 | QLD | Australia | 8.0 | 1 |
| 6 | 1107 | 15 | 34 | 2017-08-22 | 0.0 | Approved | Norco Bicycles | Standard | low | medium | 958.74 | 748.90 | 209.84 | 2005-12-07 | Jephthah | Bachmann | 0 | 59 | 1843-12-21 | 150 | 1.0 | Legal Assistant | IT | -5 | 0 | N | 0 | 20.0 | 833 Luster Way | 4005.0 | QLD | Australia | 8.0 | 1 |
| 7 | 1039 | 8 | 34 | 2017-07-01 | 1.0 | Approved | Solex | Road | medium | small | 1703.52 | 1516.13 | 187.39 | 2011-04-16 | Jephthah | Bachmann | 0 | 59 | 1843-12-21 | 150 | 1.0 | Legal Assistant | IT | -5 | 0 | N | 0 | 20.0 | 833 Luster Way | 4005.0 | QLD | Australia | 8.0 | 1 |
| 8 | 17808 | 96 | 34 | 2017-04-10 | 1.0 | Approved | WeareA2B | Road | low | small | 1172.78 | 1043.77 | 129.01 | 2002-10-10 | Jephthah | Bachmann | 0 | 59 | 1843-12-21 | 150 | 1.0 | Legal Assistant | IT | -5 | 0 | N | 0 | 20.0 | 833 Luster Way | 4005.0 | QLD | Australia | 8.0 | 1 |
| 9 | 15760 | 3 | 168 | 2017-03-18 | 0.0 | Approved | Trek Bicycles | Standard | medium | large | 2091.47 | 388.92 | 1702.55 | 2010-11-05 | Reggie | Broggetti | 0 | 8 | NaN | 120 | 0.8 | General Manager | IT | -5 | 0 | N | 1 | NaN | 16 Golf View Center | 3020.0 | VIC | Australia | 6.0 | 1 |
Last rows
| transaction_id | product_id | customer_id | transaction_date | online_order | order_status | brand | product_line | product_class | product_size | list_price | standard_cost | Margin | product_first_sold_date | first_name | last_name | gender_encoded | past_3_years_bike_related_purchases | DOB | Age | Age Scale | job_title | job_industry_category | job_industry_category_encoded | wealth_segment_encoded | deceased_indicator | owns_car_encoded | tenure | address | postcode | state | country | property_valuation | Label | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 19987 | 6671 | 31 | 1250 | 2017-03-21 | 0.0 | Approved | Giant Bicycles | Standard | medium | medium | 230.91 | 173.18 | 57.73 | 2006-11-10 | Jacklyn | Kewley | 0 | 42 | 2001-11-02 00:00:00 | 18 | 0.12 | Help Desk Technician | Manufacturing | 1 | -1 | N | 0 | 1.0 | 795 Arapahoe Hill | 4818.0 | QLD | Australia | 7.0 | 0 |
| 19988 | 3025 | 97 | 245 | 2017-12-18 | 1.0 | Approved | Solex | Standard | medium | large | 202.62 | 151.96 | 50.66 | 2016-03-29 | Noell | Grahlmans | 0 | 6 | 2001-09-26 00:00:00 | 18 | 0.12 | Associate Professor | Financial Services | -3 | 0 | N | 0 | 1.0 | 07227 Hoard Terrace | 3500.0 | VIC | Australia | 1.0 | 0 |
| 19989 | 13374 | 97 | 751 | 2017-08-22 | 1.0 | Approved | Solex | Standard | medium | large | 202.62 | 151.96 | 50.66 | 2016-03-29 | Amie | Dufty | 0 | 41 | 2001-10-31 00:00:00 | 18 | 0.12 | Business Systems Development Analyst | Financial Services | -3 | 0 | N | 0 | 1.0 | 5 Dahle Trail | 2117.0 | NSW | Australia | 10.0 | 0 |
| 19990 | 9764 | 97 | 422 | 2017-02-10 | 0.0 | Approved | Solex | Standard | medium | large | 202.62 | 151.96 | 50.66 | 2016-03-29 | Vito | Norker | 1 | 78 | 2002-01-06 00:00:00 | 18 | 0.12 | NaN | Manufacturing | 1 | 0 | N | 0 | 1.0 | 509 Fisk Hill | 2031.0 | NSW | Australia | 11.0 | 0 |
| 19991 | 4712 | 56 | 751 | 2017-04-27 | 0.0 | Approved | OHM Cycles | Standard | medium | medium | 183.86 | 137.90 | 45.96 | 1997-10-04 | Amie | Dufty | 0 | 41 | 2001-10-31 00:00:00 | 18 | 0.12 | Business Systems Development Analyst | Financial Services | -3 | 0 | N | 0 | 1.0 | 5 Dahle Trail | 2117.0 | NSW | Australia | 10.0 | 0 |
| 19992 | 5394 | 56 | 1402 | 2017-02-14 | 1.0 | Approved | OHM Cycles | Standard | medium | medium | 183.86 | 137.90 | 45.96 | 1997-10-04 | Hillier | Andraud | 1 | 58 | 2001-12-08 00:00:00 | 18 | 0.12 | Assistant Professor | Telecommunications | 5 | -1 | N | 0 | 1.0 | 42829 Charing Cross Road | 3107.0 | VIC | Australia | 8.0 | 1 |
| 19993 | 14236 | 0 | 1519 | 2017-08-18 | 1.0 | Approved | Solex | Standard | medium | medium | 71.49 | 53.62 | 17.87 | 2004-09-28 | Marwin | Jeyness | 1 | 35 | 2001-11-30 00:00:00 | 18 | 0.12 | Administrative Assistant IV | Telecommunications | 5 | 1 | N | 1 | 1.0 | 7 Bartillon Circle | 2260.0 | NSW | Australia | 8.0 | 0 |
| 19994 | 11274 | 2 | 442 | 2017-06-11 | 1.0 | Approved | Solex | Standard | medium | medium | 71.49 | 53.62 | 17.87 | 2012-12-02 | Linc | Vedyasov | 1 | 2 | 2001-10-06 00:00:00 | 18 | 0.12 | NaN | Financial Services | -3 | -1 | N | 0 | 1.0 | 3 Sutteridge Park | 4074.0 | QLD | Australia | 6.0 | 0 |
| 19995 | 7389 | 61 | 1250 | 2017-11-25 | 1.0 | Approved | OHM Cycles | Standard | low | medium | 71.16 | 56.93 | 14.23 | 2015-06-17 | Jacklyn | Kewley | 0 | 42 | 2001-11-02 00:00:00 | 18 | 0.12 | Help Desk Technician | Manufacturing | 1 | -1 | N | 0 | 1.0 | 795 Arapahoe Hill | 4818.0 | QLD | Australia | 7.0 | 0 |
| 19996 | 1048 | 19 | 2759 | 2017-08-16 | 1.0 | Approved | OHM Cycles | Road | high | large | 12.01 | 7.21 | 4.80 | 1999-06-23 | Melodee | Hendrik | 0 | 16 | 2001-11-14 00:00:00 | 18 | 0.12 | Operator | Health | -4 | 0 | N | 1 | 1.0 | 68111 Bartillon Court | 3995.0 | VIC | Australia | 3.0 | 1 |